Best Sparse Activation AI Tools & Models - Premium Sparse Activation News

AI News

Xiaomi Open Sources 309 Billion Parameter MiMo-V2-Flash Large Model, Inferencing Speed Outperforms Mainstream Competitors, API as Low as $0.1 per Million Tokens

Xiaomi releases the open-source large model MiMo-V2-Flash, which is designed for high speed and efficiency, showing outstanding performance in tasks such as inference and code generation, with response speed surpassing multiple popular domestic models. The model adopts a sparse activation architecture, with 309 billion parameters, and the weights and code are open-sourced under the MIT license.

32.7k 4 hours ago

ByteDance's UltraMem Architecture Reduces Large Model Inference Costs by 83%

The ByteDance Doubao large model team announced today the successful development of a new sparse model architecture called UltraMem. This architecture effectively addresses the high memory access issues during the inference of MoE (Mixture of Experts) models, improving inference speed by 2 to 6 times compared to MoE, and reducing inference costs by up to 83%. This groundbreaking advancement opens a new path for efficient inference of large models. The UltraMem architecture successfully resolves the memory bottleneck during inference of MoE architectures while maintaining model performance. Experimental results show that the parameters and activation conditions are the same.

22.9k 3 days ago

ByteDance Releases Doubao Large Model 1.5 Pro, Performance Surpassing GPT-4o and Claude3.5Sonnet

ByteDance officially launches its latest Doubao large model 1.5 Pro (Doubao-1.5-pro), which demonstrates outstanding comprehensive capabilities in various fields, successfully surpassing the well-known GPT-4o and Claude3.5Sonnet in the industry. The release of this model marks an important step forward for ByteDance in the field of artificial intelligence. Doubao 1.5 Pro adopts a novel sparse MoE (Mixture of Experts) architecture, utilizing a smaller set of activation parameters for pre-training. This design's innovation...

87.7k 23 hours ago

Is AI Finally Getting Smarter? MIT Researchers Discover ‘Brain Regions’ in Large Models!

Is AI actually starting to ‘get smarter’?! The latest research from MIT indicates that the internal structure of large language models (LLMs) astonishingly resembles that of the human brain! This study utilized sparse autoencoder techniques to conduct an in-depth analysis of the activation space of LLMs, revealing three hierarchical structural features that are truly remarkable: first, at the microscopic level, researchers discovered the existence of structures akin to ‘crystals’. The surfaces of these ‘crystals’ are composed of parallelograms or trapezoids, similar to familiar lexical analogies such as ‘man:’

15.5k yesterday

Models

Qwen3-Next-80B-A3B-Instruct

Alibaba

Input tokens/M

Output tokens/M

256

Context Length

Doubao-1.5-pro-32k

Bytedance

$0.8

Input tokens/M

Output tokens/M

128

Context Length

Spark X1

Iflytek

Input tokens/M

Output tokens/M

Context Length

Spark Medical Large Model - Lite

Iflytek

Input tokens/M

Output tokens/M

Context Length

Spark Tiny

Iflytek

Input tokens/M

Output tokens/M

Context Length

Gemini 2.0 Flash Thinking

Google

Input tokens/M

Output tokens/M

Context Length

Spark Max

Iflytek

Input tokens/M

Output tokens/M

Context Length

Spark Mini Instruct

Iflytek

Input tokens/M

Output tokens/M

Context Length

Spark Lite Patch

Iflytek

Input tokens/M

Output tokens/M

Context Length

Spark Mini

Iflytek

Input tokens/M

Output tokens/M

Context Length

Gemini 1.5 Pro

Google

$17.5

Input tokens/M

$70

Output tokens/M

2.1k

Context Length

Baichuan-7B

Baichuan

Input tokens/M

Output tokens/M

Context Length

Doubao-1.5-pro-256k

Bytedance

Input tokens/M

Output tokens/M

256

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AI Marketing LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

Xiaomi Open Sources 309 Billion Parameter MiMo-V2-Flash Large Model, Inferencing Speed Outperforms Mainstream Competitors, API as Low as $0.1 per Million Tokens

ByteDance's UltraMem Architecture Reduces Large Model Inference Costs by 83%

ByteDance Releases Doubao Large Model 1.5 Pro, Performance Surpassing GPT-4o and Claude3.5Sonnet

Is AI Finally Getting Smarter? MIT Researchers Discover ‘Brain Regions’ in Large Models!

Models

Qwen3-Next-80B-A3B-Instruct

Doubao-1.5-pro-32k

Spark X1

Spark Medical Large Model - Lite

Spark Tiny

Gemini 2.0 Flash Thinking

Spark Max

Spark Mini Instruct

Spark Lite Patch

Spark Mini

Gemini 1.5 Pro

Baichuan-7B

Doubao-1.5-pro-256k

Qwen3 Next 80B A3B Instruct AWQ 8bit

MoE LLaVA Qwen 1.8B 4e

Relu 35B